PrOntoLearn: Unsupervised Lexico-Semantic Ontology Generation using Probabilistic Methods
نویسندگان
چکیده
Formalizing an ontology for a domain manually is well-known as a tedious and cumbersome process. It is constrained by the knowledge acquisition bottleneck. Therefore, researchers developed algorithms and systems that can help to automatize the process. Among them are systems that include text corpora for the acquisition. Our idea is also based on vast amount of text corpora. Here, we provide a novel unsupervised bottom-up ontology generation method. It is based on lexico-semantic structures and Bayesian reasoning to expedite the ontology generation process. We provide a quantitative and two qualitative results illustrating our approach using a high throughput screening assay corpus and two custom text corpora. This process could also provide evidence for domain experts to build ontologies based on top-down approaches.
منابع مشابه
An Unsupervised Approach for Semantic Relation Interpretation
In this work we propose a hybrid unsupervised approach for semantic relation extraction from Italian and English texts. The system takes as input pairs of “distributionally similar” terms, possibly involved in a semantic relation. To validate and label the anonymous relations holding between the terms in input, the candidate pairs of terms are looked for on the Web in the context of reliable le...
متن کاملA lexico-semantic pattern language for learning ontology instances from text
The Semantic Web aims to extend the World Wide Web with a layer of semantic information, so that it is understandable not only by humans, but also by computers. At its core, the Semantic Web consists of ontologies that describe the meaning of concepts in a certain domain or across domains. The domain ontologies are mostly created and maintained by domain experts using manual, time-intensive pro...
متن کاملAn Intelligent Approach for Constructing Domain Ontology Using Art2 Neural Network and C-Value Method
Research on semantic webs has become increasingly widespread in the computer science community. The core technology of a semantic web is an artefact called an ontology. The major problem in constructing an ontology is the long period of time required. Another problem is the large number of possible meanings for the knowledge in the ontology. To overcome these problems, one approach is developin...
متن کاملSEMILAR: A Semantic Similarity Toolkit for Assessing Students' Natural Language Inputs
We present in this demo SEMILAR, a SEMantic similarity toolkit. SEMILAR includes offers in one software environment several broad categories of semantic similarity methods: vectorial methods including Latent Semantic Analysis, probabilistic methods such as Latent Dirichlet Allocation, greedy lexical matching methods, optimal lexico-syntactic matching methods based on word-to-word similarities a...
متن کاملOntology Enrichment for the Food Traceability Domain Using Romanian Lexico-syntactic Patterns
Ontologies are considered as the most important building blocks of semantic Web. Building such ontologies is a time consuming and difficult task, which requires a high degree of human intervention. In this paper we describe a method to facilitate the enrichment of Romanian language domain taxonomies by using a text-mining approach. We exploit Romanian domain specific texts in order to automatic...
متن کامل